On - Line Learning from Restricted Training Sets
نویسندگان
چکیده
{ We analyse the dynamics of on-line learning in multilayer neural networks where training examples are sampled with repetition and where the number of examples scales with the number of network weights. The analysis is based on monitoring a set of macroscopic variables from which the training and generalisation errors can be calculated. A closed set of dynamical equations is derived using the dynamical replica method and is solved numerically. The theoretical results are consistent with those obtained by computer simulations. Layered neural networks are powerful nonlinear information processing systems, capable of implementing arbitrary continuous and discrete input-output maps to any desired accuracy 1], given a suucient number of hidden nodes and a suuciently large example set. They have been employed successfully in a variety of regression and classiication tasks, and have been studied using a wide range of methods (for a review see 2]). On-line learning refers to the iterative modiication of the network parameters according to a predetermined training rule, following successive presentations of single training examples, each representing a speciic input vector and the corresponding output. On-line learning is one of the leading techniques in training large neural networks, especially via gradient descent on a diierentiable error measure. Considerable progress has been made recently in analysing the dynamics of supervised on-line learning in layered neural networks via methods of statistical physics (reviews can be found in 3] and 4]). Most of the analyses (e.g. 5, 6, 7]) have concentrated on the case of uncorrelated innnite training sets, where training examples are sampled without repetition and in which there is no correlation between the network parameters and the examples presented at each training step. However, a more realistic scenario is that where the number of training examples scales with the number of free parameters, and are sampled with repetition. This gives rise to correlations between the network parameters and the training examples, which clearly aaect the learning process. One of the most signiicant aspects of having a xed Typeset using EURO-L A T E X
منابع مشابه
Dynamics of on-line Hebbian learning with structurally unrealizable restricted training sets
We present an exact solution for the dynamics of on-line Hebbian learning in neural networks, with restricted and unrealizable training sets. In contrast to other studies on learning with restricted training sets, unrealizability is here caused by structural mismatch, rather than data noise: the teacher machine is a perceptron with a reversed wedge-type transfer function, while the student mach...
متن کاملOn-Line Learning with Restricted Training Sets: An Exactly Solvable Case
We solve the dynamics of on-line Hebbian learning in large perceptrons exactly, for the regime where the size of the training set scales linearly with the number of inputs. We consider both noiseless and noisy teachers. Our calculation cannot be extended to non-Hebbian rules, but the solution provides a convenient and welcome benchmark with which to test more general and advanced theories for s...
متن کاملOn-Line Learning with Restricted Training Sets: Exact Solution as Benchmark for General Theories
We solve the dynamics of on-line Hebbian learning in perceptrons exactly, for the regime where the size of the training set scales linearly with the number of inputs. We consider both noiseless and noisy teachers. Our calculation cannot be extended to non-Hebbian rules, but the solution provides a nice benchmark to test more general and advanced theories for solving the dynamics of learning wit...
متن کاملA Hybrid Algorithm based on Deep Learning and Restricted Boltzmann Machine for Car Semantic Segmentation from Unmanned Aerial Vehicles (UAVs)-based Thermal Infrared Images
Nowadays, ground vehicle monitoring (GVM) is one of the areas of application in the intelligent traffic control system using image processing methods. In this context, the use of unmanned aerial vehicles based on thermal infrared (UAV-TIR) images is one of the optimal options for GVM due to the suitable spatial resolution, cost-effective and low volume of images. The methods that have been prop...
متن کاملOn-Line Learning of a Persian Spoken Dialogue System Using Real Training Data
The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007